Corpus: fas_web_2012_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 65 95 98 98 99
1000 437 864 995 997 999
10000 2624 7287 9558 9899 9967
100000 5805 19127 27383 29300 29692
1000000 5805 19127 27383 29300 29692


Zipf's diagram for sentence endings


Gnuplot diagram

2336 msec needed at 2018-04-20 22:24